Model Selection for Estimating the Non Zero Components of a Gaussian Vector

نویسنده

  • Sylvie Huet
چکیده

We propose a method based on a penalised likelihood criterion, for estimating the number on non-zero components of the mean of a Gaussian vector. Following the work of Birgé and Massart in Gaussian model selection, we choose the penalty function such that the resulting estimator minimises the Kullback risk. Mathematics Subject Classification. 62G05, 62G09. Received January 13, 2004. Revised September 28, 2005. Introduction The following regression model is considered: X = m + τε, ε ∼ Nn(0, In), where X = (X1, . . . Xn) is the vector of observations. The expectation of X, say m = (m1, . . . ,mn) , and the variance τ are unknown. Assuming that some of the components of m are equal to zero, our objective is to estimate the number of zero components as well as their positions. We propose an estimation method based on a model choice procedure. We denote by J a subset of Jn = {1, 2, . . . , n} with dimension kJ , and we consider the collection J of all subsets of Jn with dimension less than kn for some kn less than n: J = {J ⊂ Jn, kJ ≤ kn} . Let x = (x1, . . . , xn) , then for each subset J ∈ J we denote by xJ the vector in R whose component i equals xi if i belongs to J and 0 if not. We denote by ‖x‖2 the Euclidean distance in R and we set ‖x‖n = ‖x‖2/n. For each subset J in the collection J , assuming that m = mJ , the maximum likelihood estimators of the parameters (m, τ) are (XJ , ‖XJc‖n), where J denotes the complement of J in Jn. We thus define a collection of estimators, (XJ , ‖XJc‖n) and the problem is now to choose an estimator of (m, τ) in this collection, or equivalently, to choose the best J in J , say Ĵ , and to take (m̂, τ̂) = (XĴ , ‖XĴc‖n). We associate to each estimator in the collection a risk defined as R(J) = E {K(m,τ2) (XJ , ‖XJc‖2n)} ,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Negative Selection Based Data Classification with Flexible Boundaries

One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...

متن کامل

Novel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection

In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...

متن کامل

بهسازی گفتار با تخمین گر کمترین میانگین مربعات خطا برپایه توزیع مخلوط لاپلاس برای گفتار

In this paper an estimator of speech spectrum for speech enhancement based on Laplacian Mixture Model has been proposed. We present an analytical solution for estimating the complex DFT coefficients with the MMSE estimator when the clean speech DFT coefficients are mixture of Laplacians distributed. The distribution of the DFT coefficients of noise are assumed zero-mean Gaussian.The drived MMSE...

متن کامل

Designing a Model for Estimating the Value of Human Assets, based on the Non-monetary Valuation Approach in Guilan University of Medical Sciences

Abstract Introduction: The ability to record the value of human assets in a financial statement is one of the requirements of modern human resource management. Objective: Design a model for estimating the value of human assets using a non-monetary valuation approach in Guilan University of Medical Sciences and prioritizing its indicators. Materials and Methods: The research method was combin...

متن کامل

Analysis of Vector Estimating Modulation Method to Eliminate Common Mode Voltage

Abstract The problem of common mode voltage in inverters can be considered as a major issue which leads to motor bearing failures. To eliminate these voltages, proposing some methods seems to be necessary. This paper has a comparative study on estimating modulation methods of eliminating common mode voltage. The main idea of these methods is based on generation of reference vector with nearest ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006